Dynamic Role Assignment using General Value Functions
نویسندگان
چکیده
Collecting and maintaining accurate world knowledge in a dynamic, complex, competitive, and stochastic environment such as RoboCup 3D Soccer Simulation is a challenging task. Knowledge should be learned in real-time with time constraints. We use recently introduced Off-Policy Gradient Descent algorithms in Reinforcement Learning that illustrate learnable knowledge representations for dynamic role assignments. The results show that the agents have learned competitive policies against the top teams from RoboCup 2012 for three vs three, five vs five, and seven vs seven agents. We have explicitly used subset of agents to identify the dynamics and the semantics for which the agents learn to maximize their performance measures, and to gather knowledge about different objectives so that all agents participate effectively within the team.
منابع مشابه
Implementation of VAT on Iran banking services in the context of dynamic stochastic general equilibrium model
In the Value Added Tax (VAT) system some goods and services, such as banking services, are exempted from taxes. Based on theoretical foundations, exempt treatment leads to several distortions and inefficiencies in the economy. In order to understand the importance of exemption on macroeconomic fluctuations as well as the fundamental role of financial intermediaries in economy shocks, this study...
متن کاملDynamic Labor Market in a Dynamic Stochastic General Equilibrium Model: Case Study of Iranian Economy
The labor market, as one of the four markets, plays an important role in economic growth and development. So review developments in the labor market because of its close relationship with developments in other sectors is of great importance. This study tries to examine the dynamics of the labor market by adjusting for a New Keynesian dynamic stochastic general equilibrium model for the Iranian ...
متن کاملFolding of Tagged Single Assignment Values for Memory-Efficient Parallelism
The dynamic-single-assignment property for shared data accesses can establish data race freedom and determinism in parallel programs. However, memory management is a well known challenge in making dynamic-single-assignment practical, especially when objects can be accessed through tags that can be computed by any step. In this paper, we propose a new memory management approach based on user-spe...
متن کاملAppropriate Labor income and Capital gain tax rates functions extraction based on Overlapping Generation Models: Dynamic Stochastic General Equilibrium (DSGE) approach
In this study, using the overlapping generation (OLG (model and the Stochastic Dynamic General Equilibrium (DSGE) approach, the optimal form of labor income tax rate and capital income tax functions is extracted for the economy of Iran using annual time series data during 1357 to 1397. The results of comparing the calibration and simulation of the designed model show that the optimal functions ...
متن کاملEffect of Sentiments on Macroeconomic Variables in Iran: A Dynamic Stochastic General Equilibrium Approach
This study aims to evaluate the effect of sentiments on Iran's economy through a New Keynesian Dynamic Stochastic General Equilibrium model in a closed economy. In this study, the coefficients of the proposed model are calibrated and estimated using the quarterly data of Iran's economy from 2004 to 2015. It shows that in the presence of sentiment, how stochastic impulses affect the main macroec...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012